Extended Tree Transducers in Natural Language Processing

نویسنده

  • Andreas Maletti
چکیده

Tree transducers are finite-state devices computing relations on trees. Their study was initiated by Thatcher (1970) and Rounds (1970), who established the classical top-down tree transducers that process the input tree from the root towards the leaves. Shortly afterwards, Baker (1973) introduced the bottom-up tree transducers that process the input tree from the leaves towards the root in analogy to the top-down and bottom-up tree automata (Thatcher, 1973). Due to applications in syntax-directed semantics (Fülöp and Vogler, 1998), tree transducers were extensively studied in the following years as detailed in (Gécseg and Steinby, 1984) and (Gécseg and Steinby, 1997). Notable extensions to the original top-down tree transducers include the top-down tree transducers with regular look-ahead by Engelfriet (1977), the attributed tree transducers of Fülöp (1981), and the macro tree transducers by Courcelle and Franchi-Zannettacci (1982) and Engelfriet and Vogler (1985). In statistical machine translation (Koehn, 2009), syntax-based models (Chiang, 2010) [i.e., models that translate from or to syntax trees] have recently seen a lot of progress. It was identified already by Eisner (2003) that the classical linear top-down and bottom-up tree transducers cannot properly handle phenomena (such as rotation) that occur during the translation between natural languages. This result was presented for synchronous contextfree grammars [SCFG] (Chiang, 2006), which is a formalism similar in spirit to (and essentially equally expressive as) the syntax-directed translation schemata by Aho and Ullman (1969), which were later refined to the more general bimorphism approach by Arnold and Dauchet (1982). Instead Eisner (2003) proposes synchronous tree

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Power of Extended Top-Down Tree Transducers

Extended top-down tree transducers (transducteurs g en eralis es descendants [Arnold, Dauchet: Bi-transductions de forêts. ICALP'76. Edinburgh University Press. 1976]) received renewed interest in the eld of Natural Language Processing. Here those transducers are extensively and systematically studied. Their main properties are identi ed and their relation to classical top-down tree transducers...

متن کامل

Survey: Weighted Extended Top-down Tree Transducers Part I - Basics and Expressive Power

Weighted extended top-down tree transducers (transducteurs généralisés descendants [Arnold, Dauchet: Bi-transductions de forêts. ICALP’76. Edinburgh University Press. 1976]) received renewed interest in the field of Natural Language Processing, where they are used in syntax-based machine translation. This survey presents the foundations for a theoretical analysis of weighted extended top-down t...

متن کامل

Applications of Weighted Automata in Natural Language Processing

Linguistics and automata theory were at one time tightly knit. Very early on, finite-state processes were used by Markov [35, 27] to predict sequences of vowels and consonants in novels by Pushkin. Shannon [48] extended this idea to predict letter sequences of English words using Markov processes. While many theorems about finite-state acceptors (FSAs) and finite-state transducers (FSTs) were p...

متن کامل

An Overview of Probabilistic Tree Transducers for Natural Language Processing

Probabilistic finite-state string transducers (FSTs) are extremely popular in natural language processing, due to powerful generic methods for applying, composing, and learning them. Unfortunately, FSTs are not a good fit for much of the current work on probabilistic modeling for machine translation, summarization, paraphrasing, and language modeling. These methods operate directly on trees, ra...

متن کامل

Efficient Inference through Cascades of Weighted Tree Transducers

Weighted tree transducers have been proposed as useful formal models for representing syntactic natural language processing applications, but there has been little description of inference algorithms for these automata beyond formal foundations. We give a detailed description of algorithms for application of cascades of weighted tree transducers to weighted tree acceptors, connecting formal the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015